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@ Improved recombinant expression method, vector and transformed celts. 

@ A method for continuous production of a desired heterolo- 
gous protein comprising constructing an expression vector 
having a stabilizing sequence downstream of a promoter and 
upstream of the DNA encoding tiie desired tieterologous 
protein, transfecting and choosing a particular eukaryotic host 
cell for said continuous production and culturing the trans- 
formed eukaryotic iiost cell under conditions favorable for 
continuous production of said desired heterologous protein. 
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Description 

IMPROVED RECOMBINANT EXPRESSION METHOD. VECTOR AND TRANSFORMED CELLS 

This invention relates to the application of recombinant DNA technology to prepare vectors capable of 
expressing desired proteins such that continuous production of the protein can be achieved. Furthermore, the 
5 invention relates to the construction of an expression vector capable of generating stable cytoplasmic mRNA 
so as to give rise to continuous production of the desired protein. In another aspect the invention relates to an 
expression vector having a specific stabilizing sequence positioned 5' to a DNA encoding a desired protein. 
The invention further relates to the transfection of eukaryotic cells with such vectors and choosing of a host 
cell such that continuous production of the desired protein by that cell line is established. 

10 Recombinant technology has recently been applied to eukaryotic cells, specifically mammalian cells were 
transformed with heterologous DNA coding for a selectable phenotype. Wigler. < M., et al., Cell 1 1 : 223-232 
(1977). It has also been shown that eukaryotic cells can be transformed to yield transformants having 
heterologous DNA integrated into the chromosomal DNA of the eukaryotic cell nucleus. 
Successful transformation of eukaryotic cell cultures and expression of DNA sequences coding for a 

15 desired protein has been disclosed. See for example. European Patent Publications Nos. 73,659 and 73.656. 
These successful transformations have utilized vectors to express complimentary DNA (cDNA's) requiring 
only 6' control signals such as enhancers (Gluzman, Y and Shenk, T. [edsj Enhancers and Eukaryotic Gene 
Expression [Cold Spring Harbor Laboratory, 1983]). promoters (Hamer. D, H. et al.. Cell g^, 697 [1980]) and 3' 
polyadenylation sites (Proudfoot. N.J. and Brownlee, G.G.. Nature 2B3, 21 1 [1976]). 

20 In 1977 it was found that in eukaryotes the cytoplasmic mRNA is not always co-linear with the DNA. DNA 
sequences encoding proteins were found to be interrupted by stretches of non-coding DNA. There are long 
stretches of base sequence in the DNA of the gene which do not appear in the final mRNA. It was observed 
that the primary mRNA transcripts were "spliced" to remove the non-coding sequences, i.e. sequences which 
do not encode a protein. These non-coding sequences in DNA are generally referred to as introns (formerly 

25 referred to as Intervening sequences) while the coding sequences are known as exons. RNA polymerase 
makes a primary transcript of the entire DNA. both exons and introns. This transcript was processed so that 
the introns were removed while at the same time the exons were all joined together in the correct order. The 
mechanism producing the foregoing result is referred to as "splicing." 
Numerous split or spliced genes have been discovered. In fact, introns exist in virtually all mammalian and 

30 vertebrate genes and also in the genes of eukaroytic microorganisms. Introns are not limited to the coding 
region of a message. For example, one intron was found in the leader region of the plasminogen activator 
mRNA before the coding sequence in addition to multiple splice sites elsewhere in the gene. Fisher, R. et a!., J. 
Biol. Chem. 260, 1122 (1985). There has been considerable speculation about why Introns have evolved and 
become such a general feature of eukaryotic genes. Crick. F.. Science 204 . 264, 1979; and. Sharp, P. A., Cell 23, 

35 643-646 (1981). 

Given the ubiquity of introns. it is not surprising that splicing was studied in the context of recombinant 
technology. For example, an SV40 vector was constructed containing a rabbit p-globin cDNA. regions 
implicated in transcription initiation and termination, splice sites from a multipartite leader sequence located 5' 
to the p-globin cDNA sequence and a polyadenylation sequence. Mulligan. R.C. et al., Nature 277. 108-114 

40 (1979). This recombinant genome, when infected into monkey kidney cells, was found to produce hybrid 
mRNAs containing the leader region for the 16S and 19S late RNA and the p-globin coding sequence. This 
hybrid mRNA produced substantial quantities of the rabbit p-globin polypeptide. Mulligan et al. discuss an 
experiment in which mutants lacking splicing capability failed to produce discrete mRNAs. Id. at 109. 
In an attempt to establish the physiological role that RNA splicing plays in gene expression. Hamer, D.H. and 

45 Leder. P., Cell 18. 1299-1302 (1979) manipulated the location and/or presence of a splice site in SV40 
recombinants transfected into monkey cells. Hamer and Leder. supra , used one splice site located within the 
^gene encoding the desired protein or used two splice site sequences, one located 5' to and the second within 
the gene encoding the desired protein. They found that RNA were produced transiently by all of the viruses 
that retain at least one functional splice junction. They concluded that splicing is a prerequisite for stable RNA 

50 formation. Confirming that result. Gruss. P. et al. PNAS (USA), 76, 4317-4321 (1979) found that construction of 
an SV40 mutant lacking an intervening sequence made no detectable capsid protein. The Gruss paper utilized 
a multipartite leader having several splice site sequences. The three papers discussed all utilize viral vectors 
with numerous splice sites at various locations. These viral vectors differ from the nonvlral vectors of the 
instant invention in several respects. First viral introns are located both 5' and 3' to the transcription unit as 

55 well as within the coding sequence itself. In the instant invention the stabilizing sequence is located 5' to the 
gene encoding the desired protein. Viral vectors continue to replicate independent of the host DNA, do not 
integrate and are lytic. Finally, many viral vectors require early gene function for correct splicing to occur. 

These two papers suggest that RNA splicing may be important in a recombinant milieu. However, other 
studies abandoned splicing to express proteins using only 5' control signals such as enhancers, and 

60 promoters and 3' polyadenylation sites. In fact, recent work by Reddy, U.B. et al.. Transcriptional Control 
Mechanisms. J. Cell. Blochem. Suppl. 10D , 154 (1986), found that the inclusion of introns in an expression 
vector actually reduced the amount of the desired protein expressed. The authors concluded that introns were 
not an essential part of vectors for the expression of a desired protein. Hall et al. also observed that including 
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an intron was detrimental to protein xpression. It was observed that deletion of the acceptor sequence 
result d In transient production of unspliced cytoplasmic viral mRNA. Treismann, R. et al. Nature ^2, 695-600 
(1981). These results support the notion that splicing is not obligatory. 

Straightforward expression using standard recombinant control signals such as enhancers, promoters and 
3' polyadenylatton sites cannot always be achieved. The SV40 promoter without a splice site has been used to 5 
direct expression of numerous cDNAs/ (P-galactosidase. Hall. C.V. et al. J. Mol. Applied Genetics 2,; human 
interferon, Gray, P.W„ et al., Nature 295, 503 (1982); hemagglutinin, Gething. et al. Nature 293, 620 (1981); 
human lecithin-cholesterol acyltransferase. McLean, J. et al., PNAS 33, 2335 (1986); DHFR, SImonsen. C.C. et 
al., PNAS 80, 2495 (1983) ; human interleukin-2. Leonard. W.T. et al.. Nature 3n. 626 (1984) ; ras-2, Capon, D.J. 
et al. Nature 304, 1983; src, Snyder, M.A. et al.. Cell 32, 891 (1983); and hepatitis B surface antigen, Crowley, 10 
C.W. et a!., Mol. Cell Biol. 3, 44-55 (1983)). However, no discrete factor VIII message of correct size was 
detected using an expression vector comprising an SV40 promoter ligated to a cDNA encoding factor VIII 
transfected into a variety of ceils. Transcription of other genes present on the same plasmid, such as DHFR. 
did produce the correct message. Since the SV40 promoter could express mRNA for certain proteins but not 
factor VIII. the problem was identified as relating to either transcription/splicing of a mRNA that would permit 15 
continuous expression or simply a lack of accumulatioin of the factor VIII message. The former problem is 
referred to herein as one of the 'stability** of the mRNA. 

Numerous experiments using various combinations of transcriptional start signals with the cDNA encoding 
factor VIII were tried. Cells transfected with such vectors were analyzed for factor VIII message by Northern 
analysis. No discrete message of the correct size was found. 20 

Experiments were also conducted with introns and splice sites present in the vectors. Okayama, H. and 
Berg. P.. Mol. and Cell. Bid. 3(2) : 280-289 (1983) utilize a plasmid vector, pcD, containing an SV40 early region 
promoter. SV40 late region intron comprising one donor site and two acceptor sites, cDNA and a 
polyadenylation signal. A vector comprising the adenovirus major late promoter and tripartite leader, having 
three splice sites and a cDNA encoding factor VIII was constructed as described in European Patent 25 
Publication No. 160.457. This vector was analyzed and found to be randomly successful In directing expression 
of full length factor VIII. This could be explained in part by cryptic splicing. The tripartite leader region is spliced 
onto multiple coding regions to yield a final message. The complexity of the splicing pattern is evident from the 
fact that 4 primary transcripts can be differentially spliced to yield 14 discrete messages. Nevlns. J. and Wilson, 
M., Nature 290. 113 (1981). The controls for selection of downstream splicing to the coding sequence is not 30 
well understood. However, selection of the appropriate polyadenylation site and transcription termination 
precede the final splicing event and may effect the selection of the 3' splice site. For these reasons and 
because the Information content of the base sequences at exon-intron junctions is relatively small it is not 
surprising that splicing is sometimes inconrect, Le. cryptic. Hamer et al., Cell 21, 697-708 (1980) and Mansour 
et al.. Mol. Cell. Biol. 6, 2684 (1986). Cryptic splicing could explain the random success in expressing full length 35 
fector VIII using the adenomajor late promoter and tripartite leader. 

Further analyses of vectors containing the adenomajor late promoter was conducted. Adenovectors had 
been used to express other proteins but with a restricted expression pattem suggesting that the adeno 
control regions could function in a limited number of cell types. Levine, A.S. et al.. Virol. 11^. 672-681 (1973) and 
Grodricker. T.J. et a[., J. Virol. 9, 559-571 (1976). Vectors were constructed using cDNA*s from other proteins 40 
such as DHFR or t-PA with the identical 5' and 3' control regions as described in European Patent Publication 
No. 160,457. Following transfectron of these plasmids into Cos, 293. BHK and CHO cells the transfectants were 
monitored for either t-PA expression by immunoperoxidase staining or DHFR expression using methotrexate. 
In summary, at no time were any of these adeno late vectors found capable of expressing t-PA or DHFR in any 
cell types other than 293 or Cos cells. Transient expression of t-PA was reproducibly seen In 293 or COS cells, 45 
however, factor VII! expression was random under the identical conditions. These results were confirmed in 
three papers in which the use of a portion of a viral multipartite leader sequence failed to express the desired 
protein. In the first paper, Kaufman, R.J. and Sharp. P.A., Mol. Cell. Biol. 2(11). 1304 (1982) constmcted a 
vector containing the adenomajor late promoter, including the first leader and 5' splice donor site of the 
adenovirus tripartite mRNA leader sequence, adjoined to two 3' splice site acceptor sequences Isolated from 50 
an immunoglobulin variable-region gene and the DHFR coding region. This vector transfected into DHFR- 
CHO cells produced a very low frequency of DHFR+ cells. In a second paper Kaufman, R.J. and Sharp, P.A.. J. 
Mol. Biol. 159. 601-621 (1982) described the same plasmid and indicated that expression of DHFR was not 
obtained. Id. at 606, Wong, G.C. et al., Science 228: 810-815 (1985) use an expression vector having: an SV40 
enhancer; the adenovirus major late promoter and tripartite leader sequences; a hybrid intron consisting of a 55 
5' splice site from the first exon of the tripartite leader and a 3' splice site from a mouse immunoglobulin gene; 
two cDNAs the first encoding a desired protein, colony stimulating factor, and the second DHFR; SV40 
polyadenylation sequence; and, VA gene. This polycistronic vector was found to work only transiently, supra at 
810, required the presence of VA RNA to increase translatability, supra at 811, and required a second cDNA, 
that of DHFR, to increase mRNA stability, supra at 811. So while a restricted transient expression capability 60 
was seen with adenovirus major late vectors which included the entire tripartite leader for some proteins, 
certain proteins have additional requirements for successful continuous expression. 

A vector was constructed containing a cytomegalovirus promoter and enhancer, a cDNA encoding factor 
Vlli. and a 3' terminating sequence, absent any intron or constructed splice site. Neither transient nor stable 
expression of factor VIM was observed in any of the cell types tested, 65 
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Another vector absent an intron or constructed splice site containing the SV40 early transcriptional 
s quences including the enhancer and promoter. cDNA encoding factor VIII and the SV40 polyadenylation site ^ 
produc d neither transient nor stable expression. 

A vector containing the SV40 promoter and enhancer, the entire adenomajor late tripartite leader i.e. three 
5 introns with appropriate donor and acceptor sites. cDNA encoding factor VIII and the 3' hepatitis surface 
antigen polyadenylation site produced transient expression of factor Vlli only in COS cells but no other ceil 
types. 

A vector containing the SV40 enhancer and promoter, the first intron of the adenomajor iate tripartite leader, 
an immunoglobulin (!g) variable region acceptor site, cDNA encoding factor VIII, and the SV40 polyadenylation 
10 site expressed factor VIII transiently in COS cells but produced no expression in other cell types. 

A vector was constructed containing an SV40 enhancer and promoter, the first donor site and intron of the 
J adenomajor tripartite leader, the consensus sequence for the Ig variable region acceptor sequence, the cDNA 
encoding factor VIII and the 3' polyadenylation site of the hepatitis surface antigen. This vector failed to provide 
transient or stable expression of express factor VIII. 
15 Yet another vector was constructed comprising the SV40 enhancer and promoter, cDNA encoding factor 
VI II, an SV40 small t-antigen intron 3' to the cDNA, complete with donor and acceptor sequence and the SV40 
early region polyadenylation site. This vector failed to produce either transient or stable expression of factor 
VIII in any cell type. 

Experiments described herein establish that a stabilizing sequence, either a donor-intron-acceptor 
20 sequence or an engineered splice sequence, is necessary for stable expression of certain proteins. Those 
experiments further establish that location of the stabilizing sequence is important for stable continuous 
expression. The present invention is directed to the construction and use of vectors having a specific 
stabilizing sequence positioned 5' to the DNA encoding certain proteins that are difficult to express. The 
expression vector of the instant Invention when transfected into a selected host cell will transfrom that host 
25 cell to one that provides continuous production of a desired protein, e.g. factor VIII. The invention is also 
directed to the choice of an appropriate cell line and transfectlon of that host cell to establish a cell line for 
continuous production of the desired protein. 

The present invention is based on the discovery that continuous production of some proteins by use of a 
recombinant expression vector requires a particular arrangement of a stabilizing sequence, located 5' to the 
30 DNA encoding the desired protein. Furthermore, the Invention relates to a stable cytoplasmic mRNA resulting 
from use of a stabilizing sequence positioned 5* to a DNA encoding a desired protein. In another aspect the 
Invention is directed to the expression vectors constructed In accord with the foregoing which express the 
gene encoding the desired heterologous protein. 
In still another aspect the invention relates to the choice of an appropriate host ceil for transfectlon with the ^ 
35 novel vector of the instant Invention. Yet another aspect of the instant invention is the transformation of a host 
cell to establish a stable cell line for production of the desired heterologous protein. 

Figure 1 Construction of a factor VIII expression vector used to establish production cell lines for factor ^ 
VIII.pFSCIS. 

Figure 2 Construction of a factor VIII expression vector used to establish production cell lines for factor 
40 Vlll.pF8SCIS. 

Figure 3 Immunoperoxidase staining of cells following transfectlon (A) shows expression following 
transfectlon with pF8CIS (B) shows expression following transfectlon with pFSSCIS. 

Figure 4 Immunoperoxidase staining of CHO ceils transfected with pFBCIS subject to one round of 
amplification. 

45 Figure 5 immunoperoxidase staining of CHO cells transfected with pFSCIS and subjected to three 

rounds of amplification. 

Figure 6 Construction of a factor VIII variant expression vector used to establish production cell lines 
for the factor VIII variant. pF8CIS9080. 

Figure 7 immunoperoxidase staining of the cells transfected with the vector pF8CIS9080 encoding the 
50 factor VHI variant or fusion protein. 

Figure 8 Immunoperoxidase staining of CHO cells transfected with pFSCIS subjected to continuous 
amplification. 

Figure 9 Construction of an expression vector containing a cDNA encoding factor VIII resistant to 
proteolytic cleavage by activated protein C. pF8CIS-336E. 
55 Figure 10 Construction of an expression vector containing a cDNA encoding a fusion protein of factor 

VIII resistant to proteolytic cleavage by activated protein C. pF89080-336E. 

Figure 11 SDS-PAGE and Western blot analysis of purified 90kd + 142aa + 80kd fusion. 
Approximately 8 \ig of the 90kd -f 142aa -f 80kd fusion was resolved by SDS-PAGE. Subsequently the 
protein was detected by staining with Coomassie blue (A) or transferred to nitrocellulose for Western blot 
60 analysis (B). A rabbit polyclonal antibody raised against plasma derived factor VIII was used to detect the 

90kd + 142aa + 80kd fusion bound to nitrocellulose. 

Figure 12 Thrombin activation of the 90kd -f 142aa + 80kd fusion. Approximately 11 ^ig of the purified 
90kd +- 142aa + 80kd fusion in 0.05 M Tris, pH 7.4, containing 0.15 M NaCI, 2.5 mM CaCl2 and 5 percent 
glycerol was incubated with 55 ng of thrombin for 0,1 to 60 minutes at 37*'C, At the times indicated an ^ 
65 aliquot was removed, diluted 1/2000-1/10,000 fold in 0.05 M Tris, pH 7.4 containing 0.01 percent BSA and 
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assayed by coagulation analysis. SDS buffer was added to the remainder of the sample which was heated 
to 90** C for 5 min. then subjected to SDS-PAGE (Inset). 

Figure 13 Binding of 90l<d + 142aa + 80kd fusion to vWF is shown. The 90kd + U2aa + 80kd fusion 
(275 units) in 0.05 M Tris, pH 7.4, 150 nM NaCla. 2.5 mM CaCIa and 5 percent glycerol was passed through 
a vWF-Sepharose column and the column was subsequently washed with three column volumes of the 5 
above buffer. The 90kd + 142aa -h 80kd fusion was eluted with 0.26 M CaCl2. The vWF-Sepharose 
column was prepared by coupling pure vWF to Affigel 10 (Bio Rad) according to the manufacturers 
specifications. 

Figure 14 Construction of a prorelaxin expression vector used to establish production cell lines for 
prorelaxin, pCIHRX. , 
Figure 15 Construction of a prorelaxin expression vector used to establish production cell lines for 

prorelaxin. pCISRX. 

Figure 16 Construction of a t-PA expression vector used to establish production cell lines for t-PA. 

pCIHt-PA. . . ^ 

Figure 17 Sequence of a portion of pF8CIS. The DNA sequence of the expression vector containing the 15 
cytomegalovirus enhancer, promoter (nucleotides 1-732). stabilizing sequence, i.e. splice donor intron 
sequence, the ig variable region intron and splice acceptor sequence (nucleotides 733-900). 

Figure 18 Sequence of a portion of pF8SCIS. The DNA sequence of the expression vector containing 
the SV40 enhancer and promoter, (nucleotides 1-360) stabilizing sequence which includes cytomegalovi- 
rus donor and intron sequence, the Ig variable region intron and splice acceptor sequence (nucleotides 20 
361-580). 

Figure 19 Sequence of a portion of pF8CSSS. The DNA sequence of the expression vector containing 
the cytomegalovirus enhancer promoter and leader (nucleotides 1-732). stabilizing sequence including 
the engineered splice donor and acceptor sequence (nucleotides 733-736). the remaining leader. 

Figure 20 Constructions of a t-PA expression vector used to establish production cell lines for t-PA. 25 
pClSt-PA. 

As used herein, •nucleotide sequence* refers to a nucleic acid comprising a series of nucleotides in a 5 to 
3' phosphate diester linkage which may be either an RNA or a DNA sequence. If a DNA. the nucleotide 
sequence may be either single or double stranded. Similarly. "DNA sequence" refers to both single and double 
stranded embodiments. 

"Desired heterologous protein" refers to a protein which is desired to be expressed in a host cell, but which 
the host cell either normally does not produce itself or produces in small amounts, and which is not normally 
necessary for the cells continued existence. Such a protein includes any molecule having the pre or mature 
amino acid sequence and amino acid or glycosylation variants (including alleles) capable of exhibiting a 
biological activity in common with said desired heterologous protein. 35 

'Splicing* refers to the mechanism by which a single functional RNA molecule is produced by the removal of 
one or more internal stretches of RNA during the processing of the primary transcript. Splicing is believed to 
begin with the looping out of the Intron so that the 5' end of the intron (referred to as the donor) is juxtaposed 
to the 3' end of the intron (refen-ed to as the acceptor). A comparison of the base sequences at intron-exon 
junctions reveals consensus sequences, with the first two bases at the 5' end of each Intron being GT and the 40 
last two bases at the 3' end being AG. 

"Spliced mRNA" refers herein to mRNA produced by either the removal of one or more intemal stretches of 
RNA or by constmcting a DNA which when transcribed produces a mRNA having the same properties as a 
mRNA which had been subject to splicing but from which no nucleotide sequence had in fact been removed. 

"Stabilizing sequence" refers to a DNA sequence that gives rise to a spliced mRNA by coding either a splice 45 
donor-intron-acceptor sequence or by coding a sequence comprising a full consensus sequence or a part 
thereof for the donor and acceptor sequence and the appropriate nucleotides at the donor/acceptor junction 
such that the resulting mRNA resembles functionally a mRNA which had been spliced. The stabilizing 
sequence is placed in the leader sequence of the gene encoding the desired heterologous protein. "Leader 
sequence* refers to that region of mRNA that is in the 5' untranslated region between the CAP site and the 50 
AUG translation start signal. 

"Consensus sequence" refers herein to the sequences XAG/GTg AGT found to occur at the exon-mtron 
boundary (or donor sequence) and ( J )nN ^AG/G found to occur at the intron-exon boundary (or acceptor 
sequence). See Mount. S.M.. Nucleic Acids Research 10(2). 459-472 (1982). Analyses of the trequency with 
which individual bases occur in particular positions yielded a consensus sequence for the donor and acceptor 55 
sequences. It is also known that introns begin with GT and end with AG. Breathnach. R. et al.. PNAS (USA) 75, 
4853-4857 (1978). It is also known that certain multipartite leader sequences in which multiple splicing events 
occur may require additional factors of early gene function to achieve proper processing. See Babiss. L.E. et 
al., Mot. and Cell. Biol. 5(10). 2552-2558 (1985). One of ordinary skill in the art using the knowledge of the donor 
and acceptor consensus sequences, multipartite leader sequences in which multiple splicing events occur 
requiring early gene function and the consensus splice sequences rule in accord with the instant invention will 
be able to select a particular stabilizing sequence for a desired protein. 

"Control region" refers to specific sequences at the 5' and 3' ends of eukaryotic genes which may be 
• involved in the control of either transcription or translation. Virtually all eukaryotic genes have an AT-rich region 

located approximately 25 to 30 bases upstream from the site where transcription Is initiated. Another 65 
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s quence found 70 to 80 bas s upstream from the start f transcription is a CXCAAT region wher X may be 
any nucleotide. At \he 3' end of most eukaryotic g nes is an AATAAA sequence which may be the signai for 
addition of the polyadenylation tail to the 3' end of the transcilb d mRNA. 

"Promoter" refers to the nucleotide segment recognized by RISIA polymerase molecules that start RNA 
synthesis. Promoters controlling transcription from vectors in mammalian host celis may be obtained from 
various sources for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, 
retroviruses. hepatitls-B virus and most preferably cytomegalovirus, or from heterologous mammalian 
promoters, e.g. beta actin promoter. The early and late promoters of the SV40 virus are conveniently obtained 
as an SV40 restriction fragment which also contains the SV40 viral origin of replication. Fiers et al., 1978. 
"Nature". 273 : 113. The immediate early promoter of the human cytomegalovirus is conveniently obtained as a 
Hindlll E restriction fragment. Greenaway. PJ. et al.. Gene 18. 355-360 (1982). Of course, promoters from the 
host cell or related species also are useful herein. 
"Enhancer" refers to cis-acting elements of DNA. usually about from 10-300 bp, that act on a promoter to 
. increase its transcription. Transcription of a DNA encoding a desired heterologous protein by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are relatively 
orientation and position independent having been found 5' (Laimins, L et al., PNAS 78, 993 [1981]) and 3' 
(Lusky, M.L, et al.. Mol. Celi Bio. 3, 1108 [1983]) to the transcription unit, within an intron (Banerji, J.L et al.. 
Ceil 33. 729 [1983]) as well as within the coding sequence itself (Osborne. T.F., et al.. Mol. Cell Bio. 4. 1293 
[1984]). Many enhancer sequences are now known from mammalian genes (globin. elastase, albumin, 
a-fetoprotein and insulin). Typlcaliy, however, one will use an enhancer from a eukaryotic cell virus. Examples 
include the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhahcers. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of 
transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments 
In the untranslated portion of the mRNA encoding the desired heterologous protein. The 3' untranslated 
regions also include transcription termination sites. 

Expression vectors may contain a selection gene, also temied a selectable marker. A selection gene 
encodes a protein, sometimes referred to as a secondary protein, necessary for the survival or growth of a 
host cell transformed with the vector. Examples of suitable selectable markers for mammalian cells are 
dihydrofolate reductase (DHFR), thynriidine kinase or neomycin. When such selectable markers are 
successfully transfen-ed into a mammalian host cell, the transformed mammalian host cell can survive if placed 
under selective pressure. There are two widely used distinct categories of selective regimes. The first category 
is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of 
a supplemented media. Two examples are: OHO DHFR- cells and mouse LTK- celis. These cells lack the 
ability to grow without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack 
certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing 
nucleotides are provided In a supplemented media. An alternative to supplementing the media is to introduce 
an Intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. 
Individual cells which were not transfomied with the DHFR or TK gene will not be capable of sun^lval in 
non-supplemented media. Therefore, direct selection of those cells requires cell growrth in the absence of 
supplemental nutrients. 

The second category is dominant selection which refers to a selection scheme used in any cell type and 
does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host 
cell. Those cells which have a novel gene would express a protein conveying drug resistance and would 
survive the selection. Examples of such dominant selection use the drugs neomycin. Southern P. and Berg. P., 
J. Molec. Appl. Genet. 1^, 327 (1982), mycophenolic acid, Mulligan, R.C. and Berg, P. Science 209, 1422 (1980) 
or hygromycin, Sugden, B. et al., Mol. Cell. Biol. 5:410-413(1985). The three examples given above employ 
bacterial genes under eukaryotic control to convey resistance to the appropriate drug neomycin (G418 or 
geneticin). xgpt (mycophenolic acid) or hygromycin. respectively. In the following experiments the selective 
agent of choice is most often G418 geneticin unless specifically referring to CHO DHFR- cells. In this case the 
direct selection for DHFR production was used. 

"Amplification" refers to the increase or replication of an isolated region within a cell's chromosomal DNA. 
Amplification is achieved using a selection agent e.g. methotrexate (MTX) which inactivates DHFR. 
Amplification or the making of successive copies of the DHFR gene results in greater amounts of DHFR being 
produced in the face of greater amounts of MTX. Amplification pressure is applied notwithstanding the 
presence of endogenous DHFR. by adding ever greater MTX to the media. Amplification of a desired gene can 
be achieved by cotransfecting a mammalian host cell with a plasmid having a DNA encoding a desired protein 
and the DHFR or amplification gene so that cointegration can occur. One ensures that the cell requires more 
DHFR. which requirement is met by replication of the selection gene, by selecting only for cells that can grow 
in successive rounds of ever-greater MTX concentration. So long as the gene encoding a desired 
heterologous protein has cointegrated with the ampliflable gene, replication of this gene gives rise to 
replication of the gene encoding the desired protein. The result is that increased copies of the gene, i.e. an 
amplified gene, encoding the desired heterologous protein express more of the desired heterologous protein. 

Preferred suitable host cells for expressing the vectors of the instant invention encoding the desired 
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heterologous proteins In higher eukaryotes include: monkey kidney CVI line transformed by SV40 (COS-7, 
ATCC CRL 1651); human embryonic kidney lino (293. Graham, F.L ot al. J. Gen Virol. 36. 59 [1977]); baby 
hamster kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary-cells-DHFR (described by Uriaub and 
Chasin. PNAS (USA) 77, 4216. [1960]); mouse Sertoli cells (TM4, Mather. J.P.. Biol. Reprod. 23, 243-251 
[1980]); monkey kidney cells (CVI ATCC CCL 70); african green monkey kidney cells (VERO-76, ATCC 5 
CRH587): human cervical carcinoma cells (HELA, ATCC CCL2); canine kidney cells (MDCK, ATCC CCL 34); 
buffalo rat liver cells (BRL 3A. ATCC CRL 1442); human lung cells (W138. ATCC CCL 75); human liver cells 
(Hep G2. HB 8065); mouse mammary tumor (MMT 060562. ATCC CCL51); rat hepatoma cells (HTC. M1.54, 
Baumann, H. et al.. J. Cell Biof. 85. 1-8 [1980]); and. TRI cells (Mather, JP. et al.. Annals N.Y. Acad. Scl. 383, 
44-68 [1982]), , w 

"Transformation" means introducing DNA into an organism so that the DNA is replicable. either as an 
extrachromosomal element or by chromosomal Integration. Unless otherwise provided, the method used 
herein for transformation of the host cells is the method of Graham, F. and van der Eb. A„ Virology 52. 456-457 
(1973). ~ 

Host cells may be transformed with the expression vectors of the Instant invention and cultured in 15 
conventional nutrient media modified as is appropriate for inducing promoters, selecting transformants or 
amplifying genes. The culture conditions, such as temperature, pH and the like, are those previously used with 
the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. 

"Transfection" refers to the taking up of an expression vector by a host cell whether or not any coding 
sequences are in fact expressed. Numerous method of transfection are known to the ordinarily skilled artisan, 20 
for example, CaP04 and electroporatlon. Successful transfection is generally recognized when any indication 
of the operation of this vector occurs within the host cell. However. In the context of the present invention 
successful transfection refers to stable continuous expression of a desired heterologous protein by a host 
culture over numerous generations. 

Choosing of the host production cell is achieved by screening for transient expression and then unampllfed 25 
expression using the method of the instant invention. Vectors were screened for transient expression to 
determine which vectors could be used to express a desired heterologous protein. Transient expression 
provides an indication of whether the particular plasmid that has been taken up functions, i.e.. Is transcribed 
and translated to produce the desired protein. During this time the plasmid DNA which has entered the cell Is 
transferred to the nucleus. The DNA is In a nonlntegrated state, free within the nucleus. Transcription of the 30 
plasmid taken up by the cell occurs during this period. Vectors which were identified as capable of producing 
the desired heterologous protein transiently were then used to establish a stable continuous production cell. 
Transient expression refers to a short period (12-72 hrs) following transfection. Following this initial period 
after transfection the plasmid DNA becomes degraded or diluted by cell division. Random integration within 
the cell chromatin occurs. Screening the cells after two to three weeks of unamplified expression is an indicia 35 
of cells which have retained the recombinant DNA leading to a permanent cell line. 

An assay based on immunoperoxidase staining of a transfected cell was developed to assess quickly 
whether a desired heterologous protein had been expressed. (Gorman. CM. et al., Cell 42, 519-522 [1985]). 
Monoclonal antibodies specific for the desired heterologous protein were screened for use in this assay. Host 
cells containing the vector were stained and compared to parental cell line for screening cells which produce a 40 
specific protein. A monoclonal antibody was selected which gave the strongest signal with the least amount of 
background. Transient transfections were perfomied to test vectors for the ability to produce a desired 
protein. Cells (Cos, 293, CHO. BHK, TM4) were transfected using the CaPO* technique. (Graham and van der 
Eb modified by Gorman, CM. et al., Science 221, 551-553 (1983)). We used ten micrograms per milliliter of 
precipitate of the specific protein vector to be tested. The precipitates were left on the cells for 3-4 hours. Cells 46 
were then glycerol shocked for an average of one minute. Thirty-six hours after transfection cells were fixed 
with acetone-methanol (50:50) and washed with phosphate buffer saline (PBS). Staining was perfonned using 
either a monoclonal antibody supernatant undiluted or purified antibody diluted 1 :3000 in PBS containing lOP/b 
fetal calf serum. This first antibody remained on the cells 2 hours. Plates were placed on a slow shaker during 
this time. Cells were washed 5 times over a ten minute period. The second antibody used was rabbit 50 
anti-mouse IgG (Dakopatts). This was diluted in PBS + fetal caif serum at a dilution of 1:150, A two hour 
incubation was followed by another series of washes. To develop the peroxidase reagent orthodiansidine was 
used as a substrate. An ethanol saturated solution of ortho-diansidine was diluted 1 :100 in PBS with 1 :10,000 
dilution of hydrogen peroxide. This substrate was left on the cells for 2 hrs at room temperature or overnight at 
4''C 55 

By this method a wide variety of vectors encoding the desired protein were quickly screened for the ability to 
direct protein expression. 

Coatest Factor Vm was purchased from Helena Laboratories, Beaumont, TX (Cat. No. 5293). The procedure 
used was essentially that provided by the manufacturer for the "end point method" for samples containing less 
than five percent protein. 60 

Production cell lines were established using plasmids of the instant invention which were shown to function 
transiently in a wide variety of cells. Expression vectors were transfected into a number of cell lines. For these 
transfections a total of 10 [ig of DNA/ml precipitate were used. Selection for expression was made possible in 
these cells using a selectable marker as described above. All cells were transfected with modified CaPA4 
technique except BRL cells which were found to be sensitive to calcium. Electroporation was used with these 65 
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cells. Transf cted cells were selected from sultabi host cells as previously d scribed. . 

The protocol used to establish pr ductlon cell lines relied heavily on the staining method described above. 
Two days following transfection. cells were subcuitured into a selection media. Media was titrated for the 
proper amount of the particular substance ne ded for selection. At the same time that c lis were transfected 
5 to establish a production cell line, a dish of each cell type was assayed for transient expression. 

In order to simplify the following examples certain frequently occurring methods and/or terms will be 
described. 

■Plasmids" are designated by a lower case p preceded and/or followed by capital letters and/or numbers. 
The starting plasmids herein are either commercially available, publicly available on an unrestricted basis, or 

10 can be constructed from available plasmids in accord with published procedures. In addition, equivalent 
plasmids to those described are known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme that acts only at certain 
sequences, restriction sites, in the DNA. The various restriction enzymes used herein are commercially 
available and their reaction conditions, cofactors and other requirements were used as would be known to the 

15 ordinarily skilled artisan. For analytical purposes, typically 1 ^ig of plasmid or DNA fragment Is used with 
about 2 units of enzyme in about 2 p,l of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to lOjig of DNA would be digested with 20 to 40 units of enzyme in a larger volume. 
Appropriate buffers and substrate amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about one hour at 37**C are ordinarily used, but may vary In accordance with 

20 the supplier's instructions. After digestion the reaction was run directly on a gel to isolate the desired 
fragment. 

"Dephosphorylation" refers to the removal of the terminal 5' phosphates by treatment with bacterial alkaline 
phosphatase (BAP). This procedure prevents the two restriction cleaved ends of a DNA fragment from 
"circularizing" or forming a closed loop that would impede insertion of another DNA fragment at the restriction 

2S site. Procedures and reagents for dephosphorylatlon are conventional. Maniatis, T. et al., 1982, Molecular 
Cloning pp. 133-134. Reactions using BAP are carried out in 50mM Tris atiSB^C to suppress the activity of any 
exonucleases which may be present in the enzyme preparations. Reactions were run for one hour. Following 
the reaction the DNA fragment is gel purified. 
"Oligonucleotides' refers to short length single or double stranded polydeoxynucleotides which are 

30 chemically synthesized by known methods and then purified on polyacrylamide gels. 

"Ligation" refers to the process of forming phosphodiester bonds between two double stranded nucleic 
acid fragments (Maniatis, T. et al.. Id., p. 146). Unless otherwise provided, ligation may be accomplished using 
known buffers and conditions with 10 units of T4 DNA ligase ("ligase") per 0.6 ^g of approximately equlmolar 
amounts of DNA fragments to be ligated. 

35 "Filling" or "blunting" refers to the procedures by which the single stranded end in the cohesive terminus of 
a restriction enzyme-cleaved nucleic acid is converted to a double strand. This eliminates the cohesive 
terminus and fomris a blunt end. This process Is a versatile tool for converting a restriction cut end that may be 
cohesive with the ends created by only one or a few other restriction enzymes into a terminus compatible with 
any blunt-cutting restriction endonuclease or other filled cohesive tenninus. Typically, blunting is 

40 accomplished by incubating 2-15fjLg of the target DNA in lOmM MgCla. ImM dithiothreltol. 50mM NaCi, lOmM 
Tris (pH 7.5) buffer at about 37° C in the presence of 8 units of the Klenow fragment of DNA polymerase I and 
250 p.M of each of the four deoxynucleoside triphosphates. The incubation generally is terminated after 30 min. 
phenol and chloroform extraction and ethanol precipitation. 

"Northern" blotting is a method by which the presence of a cellular mRNA is confirmed by hybridization to a 

45 known, labelled oligonucleotide or DNA fragment. For the purposes herein, unless otherwise provided. 
Northern analysis shall mean electrophoretic separation of the mRNA on 1 percent agarose in the presence of 
a denaturant (formaldehyde 70/o), transfer to nitrocellulose hybridization to the labelled fragment as described 
by Maniatis. T. et al., Id., p. 202. 
The following examples merely illustrate the best mode now known for practicing the invention, but should 

50 not be construed to limit the invention. AH literature citations herein are expressly incorporated by reference. 

Example 1 
Expression Vector 

55 

Factor Vll l 

1. Construction of Expression Vectors 
The cDNA encoding human factor Vlll was used in the construction of plasmids which would direct the i 
60 expression of factor VIII protein in transfected mammalian cells (Wood, W. et al., Nature [Lond.] 312:330-337 / 
(1984]). Those transformed mammalian cells secreted approximately .14 mU/ml of factor Vlll. The instant 
method provides continuous production of factor Vlll with yields significantly greater. 
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a) pFSCIS 

The V ctor pFSCIS containing the cytomegalovirus enhanc r (B shart, M. et al.. Cell 41, 520 [1985]) and 
promoter (Thomsen, D.R. et al., PNAS 81 . 659-663 [1984]), the cytomegalovirus splice donor site and a portion 
of an intron (Sternberg. R.M. et al. T. of Virol.49. 190-199 [1984]), the Ig variable region intron and splice 
acceptor site, the cDNA encoding factor VIII and the SV40 polyadenylation site was constructed. 5 

Figure 1 shows the steps for construction of the factor VIII expression vector used to establish production 
cell lines for factor VIII. The three parts of the construction are detailed below. 

1 ) The ampicillin resistance marker and replication origin of the final vector was derived from the starting 
plasmid pUC13pML a variant of the plasmid pML (Lusky, M. and Botchen, M., Nature 293, 79 [1981]). 
pUC13pML was constructed by transferring the polylinker of pUC13 (Veira. J. and Messing, J., Gene io 
19:259(1982)) to the EcoRI and Hind II I sites of pML. A second starting plasmid pUCSCMV was the source of 

the CMV enhancer, promoter and splice donor sequence. pUCSCMV was constructed by inserting 
nucieotideis 1 through 732, shown in Figure 17, for the CMV enhancer, promoter and splice donor sequence 
into the blunted PstI and Sph I sit«s of pUC8. Veira, J. and Messing. J. supra . Synthetic Bam HI-Hindlll linkers 
(commercially available from New England Blolabs) were llgated to the cohesive Bam HI end creating a Hindlll 15 
site. Following this ligation a Hindlll-Hincll digest was performed. This digest yielded a fragment of 
approximately BOObp which contained the CMV enhancer, promoter and splice donor site. Following gel 
isolation this 800bp fragment was iigated to a 2900bp piece of pUC13pML The fragment required for the 
construction of pFSCIS was obtained by digestion of the above intermediate plasmid with Sail and Hindlll. This 
3123bp piece contained the resistance marker for ampicillin, the origin of replication from pUC13pML and the 20 
control sequences for the CMV Including the enhancer, promoter and splice donor site. 

2) The Ig variable region intron and splice acceptor sequence was constructed using a synthetic oligomer as 
shown in the central portion of Figure 1. A 99 mer and a 30 mer were chemically synthesized having the 
following sequence for the IgG Intron and splice acceptor site (Bothwell et al., 1981): 

25 

1 5 ' AGTACCAAGCTTGACGTGTGGCAGGCTTGA . . . 
31 GATCTGGCCATACACTTGAGTGACAATGA . . • 

60 CATCCACTTTGCCTTTCTCTCCACAGGT. . . 

88 GTGCACTCCCAG^' 

1 S'CAGGTGAGGGTGCAGCTTGACGTCGTCGGA^' 



DNA polymerase I (Klenow fragment) filled in the synthetic piece and created a double stranded fragment. 
Warteli. R.M. and W.S. Reznikoff. Gene 9. 307 (1980). This was followed by a double digest of PstI and Hindlll. 
This synthetic linker was cloned into pUC13 (Veira, J. and Messing, J., Gene 19, 259 [1982]) at the PstI and 
Hindlll sites. The clone containing the synthetic oligonucleotide, labelled pUCIg.10, was digested with PstI. A 
Cla l site was added to this fragment by use of a Pstl-Clal linker. Following digestion with Hindlll a 118bp piece 
containing part of the Ig intron and the Ig variable region splice acceptor was gel isolated, 

3) The third part of the construction scheme replaced the hepatitis surface antigen 3' end with the 
polyadenylation site and transcription tenmination site of the eariy region of SV40. A vector, pUC.SV40 
containing the SV40 sequences was inserted into pUC8 at the Bam HI site described in VIera, J. and Messing, 
J., supra . pUC.SV40 was then digested with EcoRI and Hpa l. A 143bp fragment containing only the SV40 
polyadenylation site was gel isolated from this digest. Two additional fragments were gel isolated following 
digestion of pSVE.8c1D. European Patent Publication No. 150,457. The 4.8 kb fragment generated by Eco RI 
and Clal digest contains the SV40-DHFR transcription unit, the origin of replication of pML and the ampicillin 
resistance marker The 7.5 kb fragment produced following digestion with Clal and Hpa l contains the cDNA for 
factor VIII. A three-part ligation yields pSVe.8c24D. This intermediate plasmid was digested by Clal and Sail to 
give a 961 1bp fragment containing the cDNA for factor VIII with an SV40 polyadenylation and transcription 
termination sites followed by the SV40 DHFR transcription unit. 

The final three part ligation to yield pF8CIS used: a) the 3123bp Sail Hindlll fragment containing origin of 
replication, the ampicillin resistance marker and the CMV enhancer, promoter and splice donor; b) The 118bp 
Hindlll-Clal fragment containing the Ig intron and splice acceptor; and, c) a 961 1bp Clal -Sail fragment 
containing the cDNA for factor VIII, SV40 polyadenylation site and the SV40 DHFR transcription unit. A portion 
of the sequence of the expression vector pF8CIS is shown in Figure 17. 

b) pFSCSSS 

The vector pFSCSSS containing the cytomegalovirus enhancer and promoter, an engineered stabilizing 
sequence. the cDNA encoding factor VIII and the SV40 polyadenylation site was constructed. The entire intron 
region including donor and acceptor sequences was deleted and replaced by an engineered stabilizing 
sequence. The stabilizing sequence is a synthetic double stranded oligomer having a sequence of the mature 
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mRN A following splicing. The st8±»nizing sequence was inserted between th unique SacH-Clal sites of pF8CIS. 
The sequences of the synthetic oligom rs are as follows: 

£fi£ll 

5 ' GGCCGGGAACGGrGATTGGAACGCG 
3 • CGCCGGCCCTTGCCACTAAGCTTGCGC 

5 ' GATTCCCCGTGCCAAGAGTGACGGTGT 
CTAAGGGGCACGGTTCTCACTGCGACA 

5'CCACTCCCAC GTCCAACTGC 
CGTGAGGGTG GAGGTTGACG 

5 ' AGCTCCGGTTCGAAT3 ' 
TCGAGGCCAAGCTTAGGS ' 
Cla l 



The synthetic oligomers comprise the appropriate nucleotides of the donor and acceptor consensus splice 
sequences. The juxtaposition of the splice donor sequence to the splice acceptor sequence is indicated by the 
underline. This vector resembles the pF8CIS vector discussed above except for the deletion of the intron 
portion and replacement with an engineered stabilizing sequence. This construction eliminates the actual 
splicing of the noncoding region from recently the transcribed mRNA. A portion of the sequence of the 
expression vector pF8CSSS containing the engineered stabilizing sequence is shown in Figure 19. 

c) pFBSCIS 

The vector pFBSCIS containing the SV40 enhancer and promoter, the cytomegalovirus splice donor site and 
a portion of the intron, the Ig intron and splice acceptor site, the cDNA encoding factor VIII and the SV40 
polyadenylation and transcription termination sites were constructed. 

Figure 2 shows the construction of pFBSCIS. 

This vector was constructed using a three part ligation. The preparation of each of the three fragment of 
DNA used in this ligation is described below: 

The first fragment contained the SV40 early region promoter and enhancer and one half the ampicillin 
resistance mart<er which was obtained from plasmid pML. The starting plasmid for the first of three fragments 
was pAML3P.8CL European Patent Publication No. 160,457. This plasmid was cut with Sacl. Using the whole 
enzyme DNA polymerase I this 3' overhang created by Sacl was blunted. Following this reaction the plasmid 
was cut with Pvu l. The desired 434bp fragment was isolated from an acrylamide gel. 

The second and third fragments used in this construction were isolated from the plasmid pFBClS which is 
described above. 

Fragment 2 contained the splice donor from CMV immediate early gene and part of the following intron and 
the intron and splice acceptor synthetically made as described above. pF8CIS vyas cut with Sacll and the 
resulting 3' overhang was blunted by the use of DNA polymerase I. This reaction was followed by cleavage with 
Cla l. Since the sequence surrounding the Clal site in pFSCIS prevents cleavage if the plasmid is grown in a 
methylation plus strain, pFSCIS was prepared from dam- strain GM48. Marinus, M.G. and Maris. N.R.. 
Bacteriol. lU, 1143-1150 (1973) and Geier. G.E. and Madrid, P.. J. Biol. Chem. 254. 1408-1413 (1979). Since 
both Sac ll and Clal are unique sites in this vector the 231 bp fragment was easily isolated from an agarose gel. 

The third fragment contains the cDNA for factor VIII, SV40 eariy region polyadenylation site, a SV40-DHFR 
transcriptional unit, the origin of replication of pML and half of the ampicillin gene. The 1 1308bp fragment was 
prepared by digestion of pF8CIS (dam-) with Clal and Pvul. 

The three part ligation creating pFSSCIS destroys the Sacl and Sacll sites, maintains the Clal site and 
reconstructs the amprgene at the Pvu l site. A portion of the nucleotide sequence of the expression vector 
pFBSCIS is shown in Figure 18. 

Example 2 

Analysis of Expression 
1. Transient Expression 

Factor VIII expression was assayed based on immunoperoxidase staining of transfected cells. Gorman et al. 
Cell 42, 519-526 (1985). This assay was used to test vectors for the expression of factor VIII. Tvvelve 
monoclonal antibodies specific for factor VIII were screened for use in this assay. BHK 31A3B cells (European 
Patent Publication No. 160.457) were stained and compared with parental BHK line to screen cells which 
produce factor VIII. Monoclonal antibody BH6 was found to give the strongest signal with the least amount of 
background. Transfections were performed and transient expression of factor VIII was assessed. Cells (Cos. 
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293. CH0,BHK.TM4) wer transf cted using the CaPO* technique. Ten micrograms per mill lifter of fact rVIII 
V ctor precipitate was t sted. The precipitates were left on the cells for 3-4 hours. Cells were then glycerol 
shocked for an average of 1 minute. Thirty-six hours after transf ectlon cells were fixed with acetone-methanol 
(50:50) and washed with phosphate buffer saline (PBS). Cells were stained using either BH6 supernatant 
undiluted or purified BH6 antibody diluted 1 :3000 in PBS containing IQO/o fetal calf serum. This first antibody 5 
remained on the cells for 2 hours. Plates were placed on a slow shaker during this time. Cells were washed 5 
times over a ten minute period. A second antibody of rabbit anti-mouse IgG (Dakopatts) was diluted in PBS -f- 
fetal calf serum at a dilution of 1:150. A two hour incubation was followed by another series of washes. 
Ortho-diansidine (Sigma) was used as a substrate for developing the peroxidase reagent. A ethanol saturated 
solution of ortho-diansidine was diluted 1:100 in PBS with 1:10.000 dilution of hydrogen peroxide. This 10 
substrate was left on the cells for 2 hrs at room temperature or overnight at 4*0. 

This method provided a screen for those factor VIII vectors directing factor VIII expression. This method 
determines transient factor VIII expression. Staining thirty-six hours after transfection provides an indication of 
whether thn vector w«s transcribed and the mRNA translated. 

pF8CIS directed transient expression of factor VIII in at least five differerrt cell lines: COS, 293, CHO. TM4 15 
and BHK. Figure 3A shows transient expression of the vector pFSClS In CHO cells. 

pFBSCIS was found to direct transient expression of factor Vlll as efficiently as pFSClS. Figure 3B shows 
transient expression of the vector pFSSCIS in CHO cells. Since the CMV enhancer and promoter can be 
completely replaced by the analogous SV40 enhancer and promoter, factor Vlll production is not dependent 
on the specific transcriptional start signal but rather is dependent on other parts of the control region such as 20 
the stabilizing sequence site in the vector. 

At the same time that cells were transfected to establish a production cell line, a dish of each cell type was 
assayed for transient expression. Results of the transient expression screen for factor VIII produced two 
classes of cells: those ceil types which stained positively for factor Vlll thirty-six hours after transfection 
(Category 1 ) ; and, those cell types having no detectable transient expression of factor Vlll (Category 2). The 25 
host cells comprising each category are indicated below: 



Cateporv 1 






CHO 


HDCK 


30 


293 


BRL 




BHK 


Hela 






Vero 


35 




W138 




COS 


CVl 




HepG2 




40 


TRl 







As discussed above deletion of the Ig variable region intron and donor and acceptor sites, while maintaining 
the other control regions, resulted in elimination of transient expression of factor Vlll, From this data at least 
one splice donor-intron-acceptor sequence appears to be required for expression. 

Additional experiments indicate that location of the stabilizing sequence is important. For example location 
of an intron 3' to the cDNA encoding factor Vlll failed to express factor Vlll. Vectors which were constructed to 
include native factor Vlll splice sites, i.e. splice sites within the coding region, also proved unsuccessful. The 
splice donor-acceptor arrangement containing the CMV splice donor sequence and a chimeric intron ^ 
comprising CMV sequences and the synthesized Ig variable region intron and acceptor is an example of a 
stabilizing sequence which will lead to the establishment of a cell line providing continuous production of 
factor Vlll. 

2. Continuous Production 

Production cell lines were established by transfecting the plasmids. containing a stabilizing sequence and 
shown to function transiently in a wide variety of cells, into a number of cell lines. For these transfections a total 
of 10 jig of DNA/ml precipitate was used. For transfection of CHO DHFR cells 4 \iq of factor Vlll plasmid was 
added to 6 ^g salmon sperm DNA. which served as a carrier. Wigler. M. et al.. supra . Direct selection for 
expression of DHFR gene was possible in these cells. All other cell types required cotransfection with a ^ 
plasmid expressing neomycin gene. Davies. J. and Jenning, A., Am. J. Trop. Med. Hyg. 29(5), 1089-92) (1980). 
Either pSVENeoBa16 (European Patent Publication No. 160,457) or pRSVneo (Gorman, et al. Science, 221, 
551-553 [1983]) were used. For these transfections 4ng factor Vlll plasmid 1 jig of neomycin containing 
plasmid and 5 ^ig salmon sperm carrier were used. All cells were transfected using modified CaPO^ technique 
except for BRL as discussed above. Transfected cells included: BHK. CHO-DHFR, CV1. Vero. WI38. 293. TM4. 
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Hela, MDCK. HTC, BRL, TR1 and HepG2. 

The protocol used to establish production lines relied heavily on the staining method described above. TWo 
days following transfection cells were subcultured into selection m dia lacking glycine, hypoxanthln and 
thymidine for the CHO DHFR cells or G418 containing media. Levels of G418 were titrated for the proper 

5 amount needed for selection. 

Three to four weeks following the onset of selection, cells were screened for stable expression of factor VIIL 
A dish of clones was stained to determine the percentage of clones expressing factor VIII. Following this 
determination twelve clones of each cell type were picked for staining. Clones which scoried positive at this 
time indicating stable expression were then also assayed quantitatively by Coatest assay. 

10 Those cells which failed to demonstrate transient expression did not demonstrate stable expression at this 
time e.g. Vero, HeLa. Stable expression of factor VIII was observed in two catagories of cells: 1) some cells 
which expressed factor Vlll transiently scored negative during this first round of stable expression e.g. CHO 
and BHK cells failed to show stable expression levels high enough to stain; and 2) ceil lines which stained 
positively at both transient and unamptified stages e.g. TM4, HTC and 293 which are then referred to as 

15 potential production cell lines. 



25 





TM4 


CHO 


HTC 


BHK 


293 


Vero 




Hela 




CVl 




WI38 




BRL 



Low levels of factor Vlll in CHO cells were assayed by coatest. Results of the coatest assay were as follows : 
Factor VIII Levels in Unaniplif ted Cells 
Host Cells nU/lO* cells/day 



40 


TM4 


1.8 




HTC 


0.5 




293. 


2.5 


45 


CHO 


<0.15 




BHK 


<0.15 



The results of the foregoing assay demonstrate varying levels of factor Vlll production within the class of 
production host cells. The results of immunoperoxidase staining of transfected cells for transient expression 
at 36-48 hours followed by staining of unampllf ied cells three to four weeks thereafter was predictive for using 
a particular cell type as a production ceil for factor Vlll. If the cells did not stain positively indicating an absence 
of factor Vlli expression at the transient and unamplified levels that host cell is unlikely to serve as a production 
cell line for factor Vlll. This conclusion is supported by the results of staining CHO, BHK. HTC. TM4 and 293 
cells for factor Vlll expression after rounds of amplification. 

Following the first round of amplification (MTX:100nM) clones were again isolated from the foregoing 
transforming cell types and analyzed for production of factor VIII. Results of factor Vlll expression after this 
first round of amplification were as follows: 
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Factor VIII levels after One Round of Arapltf IcaMon 

4 

Host Cells inU/10 cells/day 



TM4 3.0 

HTC 2,3 

CHO 0.1 inU/ml 

BHK 0.1 ttU/ml 



1st Round 

C^ll Typg VgCt9r VnfilPPllfl?*^ Amplification 

TAU pFBCIS 18% 77% 

pFSSClS 15% 90% 

CHO pFSCIS 0 0.1% 

pF8SCIS 0 0.1% 



10 



Both clones and mass populations were kept for further amplification. Though factor Vlil was monitored 75 
transiently in both CHO and BHK cells, few clones were Identified after one round of MTX amplification and 
upon continued passage of cells the low levels of factor VIII were lost. Heterogeneity was seen with these 
clones. Fig. 4. CHO cells which make factor VIII had a greatly increased doubling time of 45-52 hrs and were 
overgrown by non-expressing cells in the population which have a doubling time of 28 hrs. Careful study 
demonstrated that continuous CHO clones expressing factor VIII were difficult to establish. The data 20 
presented below shows the frequency of clones expressing factor VIII In DHFR positive CHO clones at both 
the unamplified level and after one round of amplification. TM4 cells are shown for comparison. The number of 
clones which stained positive for factor VIII is given as a percentage of the number of stable clones obtained 
following transfectlon. 



25 



30 



35 



At the second or third round of amplification, usually approximately 1 jiM methotrexate, factor VIII was 
detectable In CHO cells. Even at this high level of amplification, activity of factor VIII was low, 63 mil/ml. 
Continued amplification did not lead to increased production in CHO cell lines. Morphological analysis of three 40 
separately derived CHO lines show the cells staining for factor Vill to be enlarged In size and flattened. Fig. 5. 
Transformed CHO cells amplified to 10 nM produced no more than 200 mU/ml. These results Indicate that the 
choice of a host cell is an important step in the establishment of a production cell system for factor VIII. 
Presently TM4, HTC and 293 have been used to establish permanent cell lines providing continuous 
production of factor Vlll, thus qualifying as production cell lines. 45 

Example 3 

Coagulant Activity of Factor Vlll 

The expression and secretion of active factor Vill from TM4 cells was determined by coagulation analysis. 50 
Serum free media that had been conditioned for 48 hours by TM4 cells transfected with pFSCIS was assayed 
for factor Vlll. As shown in Table 1 TM4 culture media shortened the clotting time of hemophilic plasma. Most 
of this coagulant activity was neutralized when TM4 media was preincubated with a polyclonal antibody against 
human factor Vlll. 
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Table 1 

Sampl Clot Time Units/ml 





pd VIII 




A6.5 


0.5 


10 


pd VIII + 


factor VIII 
antibody 


91.5 


<0.01 




TH4 media 




54.2 


0.36 


15 


TH4 media 


+ factor VIII 
antibody 


70.4 


O.OS 



20 

Table 1. Secretion of active factor VIII from TM4 cells. TM4 media was either prelncubated with 10 of a 
human factor VIII polyclonal antibody or with no addition for 30 minutes at 37**C. Subsequently the media was 
diluted 1 :1 with 0.05 M Tris pH 7.5 containing 0.01 o/o BSA and assayed by coagulation analysis. Purified plasma 
derived factor VIII (pd VIII) was treated similarly. 

Example 4 

Expression Vector 

^ Variant Factor VIII 

One approach to achieve a more efficient protein Is protein engineering. That Is, by introducing changes 
within the gene at the DNA level, variants can be produced in ceil culture to allow for specific modification In 
protein function. Three variants were engineered. The native factor VIII single chain 300.000 dalton protein is 
cleaved to subunits of 90.000 and 80,000 dalton which in turn are cleaved to the active subunlts of 50.000, 
^ 43,000 and 73,000 dalton. The B domain between amino acid 742 through 1648 has no defined function. Vehar. 
G.A. et aJ.. Nature 312. 330-337 (1984). The same cell systems described for expression of the full length 
recombinant factor VIII protein were used to express the mutant. 

PF8CIS9080 

The eukaryotic expression vector used to express the factor VIII fusion protein included: the enhancer 
(Boshart et al., supra) , and promoter (Thomsen et al.. supra ) of the human cytomegalovirus (CMV) immediate 
early gene ; the splice donor sequence located 3' of the transcription initiation site of this gene (Boshart et al., 
supra , Stenberg et al., supra ); and a synthetic splice acceptor site from the mouse immunoglobulin variable 
region {Bothwell~~et al., supra ), the new coding region is flanked on the 3' end by the SV40 early 
polyadenylation sequence and transcription termination site (Piers et al., supra ). The vector includes an 
amplifiable marker, the SV40-DHFR transcription unit. 

Construction of the expression vector. pF8CIS9080. encoding the factor VIII fusion protein 90kd + 142aa + 
80kd is shown in Figure 6. Starting with the plasmid pSVE.ScID (European Patent Publication No. 160,457), a 
short deletion was made in the 3' untranslated region by cutting with Sstll, blunting the cohesive ends with SI, 
further cleaving with Hpa l and religating the two blunt ends to generate pSVE.8c9. This plasmid was cleaved 
with Clai and Sa|l and the 10031 bp fragment cloned in the Sal!, ClaL A 6761 bp promoter containing fragment of 
pAML3P.D22 (European Patent Publication No. 160,457). The fusion in the factor VIII gene was made by 
ligating the filled in Tth111 I and Bam Hl (amino acid 1563) sites within the factor VllI gene. Figure 6 shows 
ligation of a 2516bp fragment of pAML3P.8cI (European Patent Publication No. 160,457) and a 11991bp 
fragment of pAML3P.8c9 to construct pAML3P.8L19 containing the fused region. This fusion was confirmed by 
DNA sequence. A 4722bp Clal-Xbal fragment containing the fusion region was cloned into a 5828bp Clal-Xbal 
fragment of pF8ClS containing the CMV promoter-enhancer expression vector. The CMV fragment was 
obtained from a dam- strain of E. coli where methylation does not prevent cutting at the Cla l site. 



Example 5 



Expression Results 

The method described in Example 2 was applied to expression of the factor Vllt variant which deleted 
nucleotides 796 through 1562, pF8ClS9080. The 90kd + 142aa + 80kd fusion protein is expressed at higher 
levels than the full length protein. However there remains considerable variation between cell types as to the 



14 



0 260 148 



capability of expressing the fusion protein. 

The following data demonstrates that the choice of a proper host cell will provide continuous production of 
the desired fusion protein in commercially useable quantities. TM4 cells transfected with pF8CIS9080 showed 
both transient and stable expression of the fusion protein. TM4 cells transfected with pF8CiS9080 showed a 
five-fold Increase In the levels of the fusion protein as compared to the full length factor Vlii. At lOOnM 5 
methotrexate pooled clones of the fusion factor VIII yielded 12mU/10* cells/day. HTC ceils showed a similar 
enhancement in expression of the fusion factor Vlil as compared to the full length factor VIll. 

Expression of the fusion protein factor Vlil is quite high in 293 ceils as compared to full length factor Vlll 
expression. In 293 cells transformed with the fusion protein vector pF8CIS9080 the unamplified population 
levels of 85 mU/10* cells/day were routinely achieved. Expression levels of full length factor Vlll were lower 10 
than the fusion factor Vlii yielding 2.5 mU/10* cells/day. Since the control signals are identical in the pFSCIS 
and pF8CIS9080, the difference in expression levels must lie within the capability of the cell to produce full 
length message and /or protein. 

The fusion protein was detected at an earlier point of amplification (lOOnM) than the full length (1000 nM). 
however as shown in figure 7 these cells were burdened by the expression of the fusion protein. CHO cells 15 
expressing 90l<d -h 142aa -f- BOkd at 100 nM produced 0.5 nU/10* cells/day. Continued amplification was 
difficult due to mixed population seen in figure 8. Clones selected at 1 |iM MTX and 5 p,M MTX showed no 
higher expression levels than the 0.1 \iM MTX lines. In summary, certain host cells were particularly adept at 
expression of factor Vlll or its variants e.g. TM4. Other host cells were of an intermediate nature in that the 
variant is expressed while the full length factor Vlll Is expressed in low levels or not at all e.g. 293 cells. A final 20 
group of host cells is unlikely to produce sufficient factor Vlil for production, e.g. TRI. 

Example 6 

Purification and Characterization of Fusion Protein 25 

The 90kd -f 1 42aa + 80kd fusion was purified from 293 media using techniques previously described for full 
length recombinant factor Vlll, Eaton, D.E. et ai., Biochemistry 25, 505-512 (1986). The purified fusion had a 
specific activity of 4.000-6,000 unlts/mg. which is comparable to the specific activity of plasma derived factor 
VIII. When subjected to SDS-PAGE the fusion resolved into two major bands with Mr of 115.000 and 80,000. A 
band with a Mr of 180,000 was also seen and probably represents the single chain fonm of the fusion. The Mr 30 
180.000, 115,000. and 80,000 proteins were all detected by a factor Vlll polyclonal antibody in a Western blot 
(Figure 11). 

Coagulantactivity of the90kd + 142aa + 80kd fusion was activated 10-20 fold by thrombin (Figure 12). This 
activation correlated with the generation of subunits with Mr 50,000, 43,000, and 73.000 (Id.). Since factor Vlll 
circulates in plasma bound to von Wiiiebrands factor (vWF). binding of the 90kd + 142aa + 80kd fusion to 35 
vWF was also tested. Purified 90kd + 142aa -f 80kd fusion that was passed over a vWF-Sepharose column 
quantitatively bound to the column (Figure 13). Subsequently the fusion was eluted from the column with 0.25 
M CaCla. which is known to dissociate factor Vlll-vWF complexes. 

The above data show that the 90kd + 142aa + 80kd fusion expressed and secreted from the 293 cells was 
functionally similar to plasma derived factor Vlll. The 90kd + 142aa + 80kd fusion shortened the clotting time 40 
of hemophilic plasma and its activity was neutralized by a factor Vlll antibody. The fusion was activated and 
processed by thrombin similarly to plasma derived factor Vlll. The 90kd + 142aa -i- 80kd fusion also bound to 
vWF immobilized on Sepharaose and was dissociated from vWF under conditions known to dissociate factor ' 
Vlll-vWF complexes. 

45 

Example 7 

Coagulant Activity of Fusion Protein 

Serum free media that had been conditioned for 48 hrs. by 293 cells transfected with pF8CIS9080 was 
assayed for factor Vlll coagulant activity by coagulation analysis. After the media was diluted 1/50 it was 50 
assayed and found to shorten the clotting time of hemophilic plasma from 120 sec. to 58,9 sec, corresponding 
to 5.5 units/ml of factor VIII coagulant activity. This activity was neutralized by a polyclonal antibody against 
plasma derived factor Vlll (Table 2). 
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Table 2 



5 Sample Clot Tine Units/ml* 

(seconds) 





Buffer 


120.0 


<0.1 


10 


Diluted 293 -media 


58.9 


5.5 


15 


Diluted 293-inedia 
Preinciibated with 
factor VIII antibody 


118.0 


<0.1 


rtn 


Undiluted 293 -media 
(parent cell line) 


100.0 


<0.1 



* In undiluted media 



25 Table 2. Coagulant activity in 293 media. 

Hemophilic plasma (50 was Incubated with 60 \i\ of platelin (General Diagnostics) for 8 min. at 37^C. 

Subsequently 50 \i\ of media that had been diluted 1/50 with 0.05 M Tris pH 7.4 buffer containing 0.01 percent 

BSA was added and incubated 30 sec. CaCIa (25 mM). 50 jil, was added and the clot time was measured. 

Media obtained from the parent cell line, which was not transfected, was not diluted. Antibody neutralization 
30 experiments were perfomned by prelncubating undiluted media (100 jtl) with 10 fig of factor VIII polyclonal 

antibody for 30 min. at room temp. The media was then diluted and assayed. 

Example 8 

35 Expression Vector of Factor VIII Variant Resistant to Activated Protein C 

Activated protein C (APC). a plasma protein, has been shown to inactivate human factor Vlli by limited 
proteolysis. One possible site of this inactivation cleavage is at arginine at position 336. The arginine at position ^ 
336 can be changed to another amino acid, for example, lysine or glutamic acid. Two vectors, pF8CIS336E and 
pF8ClS9080-336E, were constructed to determine whether position 336 was a site of inactivation. Using in 

40 vitro mutagenesis (Norris. K. et a[.. Nucleic Acids Research. 11^. 5103-6112 [1983]) the arginine at position 336 
was mutated to a glutamic acid (Fig. 9). For the mutagenesis a 792 bp Hlndlll-Kpni fragment from pFSCIS was 
inserted into the Hindlll-Kpnl sites of m13. The 18 bp oligomer shown below was used to mutagenize this 
fragment. 

45 ■ -m 

P Q L E M K N 

5' CC CAA CTA GAA ATG AAA A 3' 

50 * 

Following strand extension the double stranded mutagenized M13 clone was cut with Acc I and Kpn I. A 778 
bp fragment was gel purified. The plasmid pFSCIS was grown in a dam- strain of E. coll . GM48. Due to the 
sequence of the Pstl-Clal linker shown in figure 1 , the Clal site of pFSCIS will not cut if the plasmid is grown in a 

55 methylation plus strain of bacteria as discussed above. Two fragments were isolated from the dam- pFSCIS 
DNA, a 1 0kb Kpnl partial-Clal fragment and a 11 08 bp Clal-AccI fragment. A three part ligation was required to 
replace the native factor VIII sequence with the mutagenized sequence. See figure 9. 

Construction of pF89080-336E proceeded via another three part ligation as shown in Figure 10. A 11 15 bp 
Spel-Bglll fragment containing the 336E variant amino acid was transferred to create another variant fusion 

^0 protein by ligation to a 891 bp Sacll-Spe! fragment and a 8564bp Bglll-Sacll fragment isolated from 
PF8CIS9080. 

Both of these protein variants were expressed in 293 cells. Full length factor VIII with this mutation was 
expressed at 2.8 mU/IO"* cells/day while the fusion variant was expressed at 15 mU/10* cells/day. ^ 
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Example 9 

Activity of Factor VIII Resistant to Activated Protein C 

Media obtained from 293 cells transfected with pF8CIS-336E shortened the clotting time of hemophilic 
plasma. This activity was neutralized by a factor VIII polyclonal antibody (Table 3). Activated protein C, 5 
however, did not inactivate recombinant factor VIII containing a glutamic acid at position 336 (Table 3). 

Table 3 



10 



Sample 


Clot Time 


Units/ml 




293-336E 


70.4 


0.8 


15 


rVIII 


65.8 


1.1 




293-336E media -i- APC 


68.5 


0.9 


20 


rVIII + APC 


85.0 


0.28 




293-336E media 
Preincubated with 
factor VIII polyclonal 


95.0 


<0.1 


25 



Table 3. Stability of full length factor VIII/336E to Activated Protein C. Senjm free media that had been 
conditioned for 48 hrs by transfected 293 cells producing full length factor VIII/336E (referred to as "293-336E'' 
In the table) was concentrated 27 fold. To 100 jxl allquots of this media. 10 p,! of rabbit brain cephalin and 10.0 
ng of APC was added. Controls received no APC. Samples were incubated for 40 min. at 37" C. Similarly, 
purified recombinant factor VIII containing the arginine at position 336 (rVIII) was diluted to -1 units/ml with 

0. 05 M Tris. pH 7.5. 150 mM NaCi, 2.5 mM CaClz and Incubated with rabbit brain cephalin and APC. Media from 
transfected 293 cells producing full length factor VIII/336E (26X) was also preincubated with a factor VIII 
polyclonal antibody djil of IgG prep) for 40 min at 37° C. At the end of incubations, samples were assayed by 
coagulation analysis. 

Example 10 

Expression Vector Prorelaxin 

1. pCIHRX 

The Vector pCIHRX contained the cytomegalovirus enhancer and pronnoter, the cylomegalovlnjs splice 
donor site, the Ig variable region splice acceptor site, the cDNA encoding H2 preprorelaxin and the hepatitis 
surface antigen polyadenylation and transcription termination sites. Figure 14 shows the steps for 
.construction of the prorelaxin vector. The same intron and splice acceptor sequence described previousty 
from the Ig variable region was maintained. 677bp of the preprorelaxin cDNA followed these 5' processing 
signals. While the 5' control signals were identical to pF8CIS the polyadenylation region and termination 
sequence signals were from the hepatitis surface antigen gene rather than SV40. 

An intermediate plasmid pClaRX was first constructed The plasmid pSVERX (see copending U.S. patent 
application U.S.S.N. 06/907,197. filed September 12, 1986, and corresponding European application) was cut 
with Hindlll to isolate a 1700bp fragment containing the pre-prorelaxin cDNA followed by the hepatitis B 
surface antigen (HBsAg) 3' polyadenylation site. A Kpn l site was 3' to the HBsAg polyadenylation site and 5' to 
the start of the SV40 early promoter which in this vector was used to drive expression of the DHFR cDNA. 

This Hindlll fragment was inserted into pML linearized at the Hindlll site. Reclosures were minimized by 
treatment with bacterial alkaline phosphatase (BAP). Amplcillin resistant colonies were screened to isolate 
clones which had inserted the pre-prorelaxin gene so that the 5' end of the gene was next to the Clal site of 
pML. 

The intermediate plasmid pCIARX was cut with Clal and Kpn l to isolate a 1360bp fragment containing the 
pre-prorelaxin gene followed by the hepatitis surface antigen 3' polyadenylation sequences. This fragment was 
ligated to the 5143bp fragment created by cutting pF8ClS dam- with Clal and Kpn l. 
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2. pCISRX 

Because the choice of polyadenylation s quences is known to influence 5' processing of m ss nger RNA 
(Wilson & Nevins, supra ), the 3' hepatitis polyadenylation sequence in pCIHRX was replaced with the pSV40 
polyad nylation sequence. This construction was designated pCISRX. The two starting vectors for this 

5 construction are pCIHRX and pFSCIS. The latter vector has the same 5' controls as pCIHRX but includes the 
cDNA for factor VIII and the SV40 polyadenylation site. SacI I was used to cleave 3' of the cDN A. The resultant 3' 
overhang was blunted by T4 polymerase. pCIHRX was then cut with Bam HI. This site separates the chimeric 
intron from the 5' end of the relaxin gene. An 861 bp fragment was gei isolated from the BamHI treatment. The 
SV40 polyadenylation site, DHFR, transcription unit, bacterial origin of replication and ampr gene, as well as the 

10 CMV enhancer and promoter and splice donor were isolated from pFBCIS. These elements were isolated in 
two fragments, as a 2525bp Sall-BamHI fragment and a Hpal-Sall 3113 bp fragment. A three part ligation of the 
Bam Hl-Sacll (blunted) fragment with the Hpal- Sail fragment and Sail to Bam HI fragment yields pCISRX. 

Example 11 

15 

Expression Prorelaxin 

The expression capabilities of the two relaxin expression vectors pCIHRX and pCiSRX, were assayed using 
several anti-relaxin antibodies in the immunoperoxidase method described above. Three rabbit polyclonals 
and three mouse monoclonal antibodies were tested on COS cells transfected with pSVERX. One monoclonal 
20 RX-1 was found to give intense staining with no background. 

The two vectors of this invention, pCIHRX and pCISRX, were tested for prorelaxin expression and compared 
to pSVERX. pClHRX and pClSRX vectors differed in the polyadenylation sequence. pCIHRX contained the 
hepatitis surface antigen polyadenylation sequence while pCISRX contained the SV40 eariy region 
polyadenylation sequence. 

25 293. TM4 and CHO cells were transfected with 10 ^§ total DNA which included 1 iig pRSVneo. 5 jjig salmon 
spenm carrier and 4 p,g of plasmids pSVERX. pCIHRX and pCISRX. Cells were glycerol shocked as described 
above. Thirty-six hours following transfection cells were fixed and stained with IH6 to identify transformed cells 
making prorelaxin. Positive staining cells were seen in 293 and TM4 cells transfected with pCIHRX and 
pCISRX. Duplicate plates of CHO. 293 and TM4 cells were split and subjected to the staining protocol 

30 described above to screen for prorelaxin production cells. 

Expression results are shown in the tables below indicating that the vectors containing the stabilizing 
sequence 5' of the DNA encoding prorelaxin produced significantly higher levels of prorelaxin than the 
reference plasmid, pSVERX. In the case of stable expression the media assay for prorelaxin was from the 
general population of cells. 
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Transient ^r^^^inn Prorelaitin 



Cell Typg 

CHO 



1M4 



pSVERX 
pCIHRX 
pCISRX 

pSVERX 
pCIHRX 
pCISRX 



Amount of Protein /^Tlgi^ml) 

0.9 
3 

0.4 
2 
10 



10 



15 



293 



pSVERX 
pCIHRX 
pCISRX 



0.4 
3 
12 



20 



Stable Expression Prorelaxtn 



g^U Type 

CHO 



Plasmid 
pSVERX 
pCIHRX 
pCISRX 



Amount of Proteii. ^^fr/yp] ) 
0.6 
0.8 
3.9 



25 



30 



35 



293 



pSVERX 
pCIHRX 
pCISRX 



0.41 
3.0 
22.0 



Example 12 

Expression Vector t-PA 
1. pCIHt-PA 

The vector pCIHt-PA containing the cytomegalovirus enhancer and promoter, the cytomegalovirus splice 
donor site and intron. the Ig variable region splice acceptor site, the cDNA encoding t-PA (Pennica et al., 
Nature 301 , 214 (1983)) and the hepatitis surface antigen polyadenylation and transcription terminatiorTsite 
was constructed. 

Figure 16 shows the steps for construction of the t-PA vector. 

The t-PA cDNA was first cloned into pML to provide a Clal site at the 5' end of the gene. To do this a 3238 bp 
Hindill fragment from pSVpa-DHFR (othenwise referred to as pETPFR in UK patent 2,119,804 B) was inserted 
into the Hindill site of pML Colonies were screened for clones which have the 5' end of the cDNA juxtaposed 
to the Clal site. The intermediate plasmid labelled pCLAt-PA is shown in Figure 16. A t-PA cDNA followed by 
the 3' polyadenylation region was isolated as a Clal-Kpnl fragment of 2870bp. This fragment was ligated to the 
5146bp fragment of pF8CIS. This Clal-Kpnl fragment of the CIS vector provided the 5' control region, a 
SV40-DHFR transcriptional unit, the ampicillin resistance gene and origin region from pML pCIHt-PA is 
analogous to pCIHRX. discussed above, with the exception of the cDNA coding for the desired heterologous 
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Expression levels of t-PA were compared by transfecting CHO and 293 cells with pSVpaDHFR, pCMVt-PA 
and pCIHt-PA The former two vectors did not contain a stabilizing sequence and thus served as controls for 
the vector pCIHt-PA containing the cDNA encoding t-PA constructed in accord with the instant invention. 
5 Media from each of the cultured transformed 293 cells were assayed and the following results were obtained: 
pSVpaDHFR gave 30 ng/mi ; pCMVt-PA gave 200 ng/ml of t-PA; and pCIHt-PA gave 420 ng/ml of t-PA. 

2. pCiSt-PA , . 

The vector pCiSt-PA containing the cytomegalovirus enhancer and promoter, the cytomegalovirus splice 
10 donor site and intron. the Ig variable region splice acceptor site, the cDNA encoding t-PA and the pSV40 

poiyadenylation sequence was constructed. 
The starting vectors for this construction are pCIHt-PA and pFSCiS (see Figure 20) . The latter vector has the 

same 5' controls as pCIHt-PA but includes the cDNA for factor VIII and the SV40 poiyadenylation site. Sacli 

was used to cleave 3' of the t-PA cDNA. The resultant 3' over hang was blunted by T4 polymerase. pCIHt-PA 
15 was then cut with Clal. This site separates the chimeric intron from the 5' end of the t-PA gene. A 2870bp 

fragment was gel isolated from the Clal treatment. The SV40 poiyadenylation site. DHFR. transcription control. 

bacterial origin of replication and ampr gene, as well as the CMV enhancer and promoter and splice donor were 

Isolated from pFSCIS. These elements were isolated into fragments as a 2525bp Sall-Clal fragment and a 

Hpal-Sall 31 1 3 fragment. A three part ligation of the SaclKbluntj-Oal fragment with the Hpal-Sall fragment and 
20 Sall-Clal fragment yields pCISt-PA. 

"Expression levels of t-PA were compared by transfecting 293 and CHO cells with pCIHt-PA and pCISt-PA. 

Media from each of the cultured transformed cells were assayed and the following results were obtained: 



25 



Transient 
(t-PA ng/ml) 



CHO 


CIS 


55 


30 


CIH 


15 


293 


CIS 


3000 




CIH 


1300 



35 



40 

Claims 

1. A method for continuous production of a desired heterologous protein in a eukaryotic host cell 

comprising: . , 

^ a) constructing an expression vector having a sequence of double stranded DNA compnsing the 

following elements: 

1) a stabilizing sequence downstream of a promoter and upstream of a DNA encoding the amino acid 
sequence of the desired heterologous protein; 

2) DNA encoding the amino acid sequence of the desired heterologous protein downstream of said 
stabilizing sequence; and, ... 

3) DNA coding a poiyadenylation sequence downstream of which is a transcription termination site; 

b) transfecting and then choosing a eukaryotic host cell with said expression vector; and 

c) culturing the transfected eukaryotic host cell under conditions favourable for continuous 
production of the desired protein. 

2. The method of claim 1 wherein the promoter is from the immediate early gene of human 
cytomegalovirus. 

3. The method of claim 1 wherein the promoter is from simian virus 40 (SV40) . 

4. The method of any one of claims 1 , 2 and 3 wherein the stabilizing sequence comprises at least one 
but not more than two splice donor-intron-acceptor sequences. 

QQ 5. The method of claim 4 wherein the splice donor sequence of the splice donor-intron-acceptor 

sequence is from the immediate eariy gene of human cytomegalovirus. 

6. The method of claim 4 wherein the intron of the splice donor-intron-acceptor sequence is from the 
human cytomegalovirus and the immunoglobulin variable region. 

7. The method of claim 4 wherein the splice acceptor sequence of the splice donor-intron-acceptor 
55 sequence corresponds to the immunoglobulin acceptor sequence. 
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8. The method of any one of claims 1, 2 and 3 wherein the stabilizing sequence comprises an 
engineered DNA coding a mRNA ha\^ng the sam prop rtles as a mRNA which had been subject to 
spiicing but from which no nucleotide sequence had in fact been removed. 

9. The method of any on of the pr ceding claims wherein the DNA encoding the amino acid sequence 

of a heterologous protein encodes factor Vlli. s 

10. The method of any one of claims 1 to 8 wherein the DNA encoding the amino acid sequence of a 
heterologous protein encodes t-PA. 

11. The method of any one of claims 1 to 8 wherein the DNA encoding the amino acid sequence of a 
heterologous protein encodes proreiaxin. 

12. The method of any one of claims 1 to 8 wherein the DNA encoding the amino acid sequence of a 10 
heterologous protein encodes a variant of factor VIII. 

1 3. The method of claim 1 2 wherein the factor Vlil is resistant to cleavage by activated protein C. 

14. The method of any one of the preceding claims wherein the DNA coding the polyadenyiation 
sequence is from simian virus 40 (SV40). 

1 5. The method of any one of the preceding claims wherein the host cell is mouse Sertoli cell (TM4). is 

16. The method of any one of claims 1 to 14 wherein the host ceil is hepatoma cell (HTC). 

17. The methodofanyoneof claims 1 to 14 wherein the host ceil is human embryonic kidney cell (293). 

18. The method of any one of the preceding claims wherein the expression vector includes an enhancer. 

19. The method of claim 18 wherein the enhancer is located upstream of the promoter. 

20. The method of claim 18 or claim 19 which includes an enhancer and promoter from simian virus 20 
(SV40). 

21. A vector suitable for continuous expression in a eukaryotic host ceil culture of a desired 
heterologous protein which vector has the features defined in any of the preceding claims. 

22. TM4 cells, HTC ceils or 293 cells transformed with an expression vector having the features defined in 

any one of claims 1 to 20. 25 
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alul 
sad 
hgiAI 
bspl286 
banll 

taqi spel 
1 TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 
AAGCTCGAGC GGGCTGTAAC TAATAACTGA TCAATAATTA TCATTAGTTA ATGCCCCAGT 
from pPMLCMV beginning to Hindll I, enhancers and promoter 

scrFI 
bgll bstNI 
sau96I 

thai haelll 
61' TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
AATCAAGTAT CGGGTATATA CCTCAAGGCG CAATGTATTG AATGCCATTT ACCGGGCGGA 

ahall 
aatll 

121 GGCTGACCGC CCAACGACCC CCGCCCATTG ACGOXZAATAA OTGACGTATGT TCCCATAGTA 
CCGACTGGCG GGTTGCTGGG GGCGGGTAAC a?GCAGTTATT ACTGCATACA AGGGTATCAT 

ahall 

aatll bgll 
181 ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC 
TGCGGTTATC CCTGAAAGGT AACTGCAGTT ACCCACCTCA TAAATGCCAT TTGACGGGTG 

ahall 

rsal ndel rsal aatll 

241 TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 
AACCGTCATG TAGTTCACAT AGTATACGGT TCATGCGGGG GATAACTGCA GTTACTGCCA 

bgll 

sau96I scrFI nlalll 

haelll bstNI rsal rsal 

301 AAATGGCCCG CCTGGCATTA TGCCCAGTAC Aax;ACCTTAT GGGACTTTCC TACTTGGCAG 
TTTACCGGGC GGACCGTAAT ACGGGTCATG TACTGGAATA CCCTGAAAGG ATCAACCGTC 

nialll 
styl sfaNI 

snaBI ncol hphi rsal 

361 TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
ATGTAGATGC ATAATCAGTA GCGATAATGG TACCACTACG CCAAAACCGT CATGTAGTTA 

ahall 

hinfl aatll 
421 GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT 
CCCGCACCTA TCGCCAAACT GAGTGCCCCT AAAGGTTCAG AGGTGGGGTA ACTGCAGTTA 



nlalV 
bani 

481 GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC 
CCCTCAAACA AAACCGTGGT TTTAGTTGCC CTGAAAGGTT TTACAGCATT GTTGAGGCGG 



\ 



02601A8 



alul 
sad 
hgiAI 
bspl286 

hgal rsal mnll banll 

541 CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 
GGTAACTGCG TTTACCCGCC ATCCGCACAT GCCACCCTCC AGATATATTC GTCTCGAGCA 

scrFI 
sau3AI hgal 

dpnl bstNI ahall fokl mnll mboll 

601 TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA 
AATCACTTGG CAGTCTAGCG GACCTCTGCG GTAGGTGCGA CAAAACTGGA GGTATCTTCT 
Begin RNA 



sau96I 
avail 
nlalV 

scrFI 

ncil 



scrFI 
ncil 
haelll 
xmalll 
eael 
fnu4HI 



mspl sau3AI mnll thai mspl 

hpall dpnl bgll sacll hpall thai hinfl 

661 CACCGGGACC GATCCAGCCT CCGCGGCCGG GAACGGOXSCA TTGGAACGCG GATTCCCCGT 
GOXSGCCCTGG CTAGGTCGGA GGCGCCGGCC CTTGCCACGT AACCTTGCGC CTAAGGGGCA 

bstXI 
sau96I 

rsal hinfl haelll styl 

721 GCCAAGAGTG ACGTAAGTAC CGCCTATAGA GTCTATAGGC CCACCCCCTT GGCTTCTTAT 
CGGTTCTCAC TGCATTCATG GCGGATATCT CAGATATCCG GGa?GGGGGAA CCGAAGAATA 

hael 
eael 

sau3AI ball 
dpnl sau3AI 
xholl alul dpnl 

nlalV ddel mnll xholl 

bamHI rsal hindlll bglll haelll 

781 GCGACGGATC CCGTACTAAG CTTGAGGTGT GGCAGGCTTG AGATCTGGCC MACACTTGA 
CGCTGCCTAG GGCATGATTC GAACTCCACA CCGTCCGAAC TCTAGACCGG TATGTGAACT 
IgE synthetic lOOmer 

fnu4HI 
bbvl 

fokl PStI 
841 GOXSACAATGA CATCCACTTT GCCTTTCTCT CCACAGGTGT CCACTCCCAC GTCCAACTGC 
CACTGTTACT GTAGGTGAAA CGGAAAGAGA GGTGTCCACA GGTGAGGGTG CAGGTTGACG 

Pstl-Clal 
converter 



clal 
sauBAI 
dpnl 
pvul 

alul taqi taqi 
901 AGCTCGGTTC GATCGATAA 
TCGAGCCAAG CTAGCTATT 



Fig.17(cont.) 
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Fig.19. 

aim 
sad 
hgiAI 
bspl286 
banir 

taqi spel 
1 TTCGAGCTCG CCCGACATOX; ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 
AAGCTCGAGC GGGCTGTAAC TAATAACTGA TCAATAATTA TCATTAGTTA ATGCCCCAGT 
from pPMLCMV beginning to Hindi I I, enhancers and promoter 

scrFI 
bgll bstNI 
sau96I 

thai haelll 
61 TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
AATCAAGTAT CGGGTATATA CCTCAAGGCG CAATGTATTG AATGCCATTT ACCGGGCGGA 



ahall 
aatll 

121 GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA 
CCGACTGGCG GGTOXSCTGGG GGC6G6TAAC TGCAGTTATT ACTGCATACA AGGGTATCAT 

ahall 

aatll bgll 
181 ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGA6T ATTTACGGTA AACTGCCCAC 
TGCGGTTATC CCTGAAAGGT AACTGCAGTT ACCCACCTCA TAAATGCCAT TTGACGGGTG 

ahall 

rsal ndel rsal aatll 

241 TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 
AACCGTCATG TAGTTCACAT AGTATACGGT TCATGCGGGG GATAACTGCA GTTACTGCCA 



bgll 

' sau96I scrFI nlalll 
haelll bstNI rsal rsal 

301 AAATGGCCCG CCTGGCATTA TGCCCAGTAC AOXSACCTTAT GGGACTTTCC TACTTGGCAG 
TTTACCGGGC GGACCGTAAT ACGGGTCATG TACTGGAATA CCCTGAAAGG ATGAACCGTC 

nlalll 
styl sfaNI 

snaBI ncol hphi rsal 

361 TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
ATGTAGATGC ATAATCAGTA GCGATAATGG TACCACTACG CCAAAACCGT CATGTAGTTA 



0:^60148 



is 



ahall 

hinfl aatll 
421 GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT 
CCCGCACCTA TCGCCAAACT GAGTGCCCCT AAAGGTTCAG AGGTGGGGTA ACTGCAGTTA 



nlalV 
bani 

481 GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC 
CCCTCAAACA AAACCGTGGT TTTAGTTGCC COXSAAAGGTT TTACAGCATT GTTGAGGCGG 

alul 
sad 
hgiAI 
bspl286 

hgal rsal mnll banll 

541 CCATTGACGC AAATGGGCGG TAGGC!GTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 
GGTAACTGCG TTTACCCGCC ATCCGCACAT GCCACCCTCC AGATATATTC GTCTCGAGCA 

scrFI 
sau3AI hgal 

dpnl bstNI ahall fokl mnll mboll 

601 TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA 
AATCACTTGG CAGTCTAGCG GACCTCTGCG GTAGGTGCGA CAAAACTGGA GGTATCTTCT 
Begin 

scrFI 

sau96I ncil 

avail haelll 
nlalV xmalll 
scrFI eael 
ncil fnu4HI 
mspl sau3AI mnll thai mspl 

hpall dpnl bgll sacll hpall hphi thai hinfl 

661 CACCGGGACC GATCCCAGCC TCCGCGGCCG GGAACGGTGA TTGGAACGCG GATTCCCCGT 
Ga?GGCCCTGG CTAGGGTCGG AGGCGCCGGC CCTTGCCACT AACCTTGCGC CTAAGGGGCA 

clal 

alul sau3AI 
fnu4HI dpnl 
bbvl mspl taqi taqi 
tthllll psti hpall pvul 

721 GCCAAGAGTG ACGGTGTCCA CTCCCACGTC CAACTGCAGC TCCGGTTCGA TCGATAA 
CGGTTCTCAC TGCCACAGGT GAGGGTGCAG GTTGACGTCG AGGCCAAGCT AGCTATT 
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